Minimax Q-learning control for linear systems using the Wasserstein metric

نویسندگان

چکیده

Stochastic optimal control usually requires an explicit dynamical model with probability distributions, which are difficult to obtain in practice. In this work, we consider the linear quadratic regulator (LQR) problem of unknown systems and adopt a Wasserstein penalty address distribution uncertainty additive stochastic disturbances. By constructing equivalent deterministic game penalized LQR problem, propose Q-learning method convergence guarantees learn minimax controller.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P14: Anxiety Control Using Q-Learning

Anxiety disorders are the most common reasons for referring to specialized clinics. If the response to stress changed, anxiety can be greatly controlled. The most obvious effect of stress occurs on circulatory system especially through sweating. the electrical conductivity of skin or in other words Galvanic Skin Response (GSR) which is dependent on stress level is used; beside this parameter pe...

متن کامل

Performance and robustness analysis of stochastic jump linear systems using Wasserstein metric

This paper focuses on the performance and the robustness analysis of stochastic jump linear systems. The realization of the state trajectory under stochastic jump processes becomes random variables, which brings forth the probability distributions for the system state. Therefore, a proper metric is necessary to measure the system performance with respect to stochastic switching. In this perspec...

متن کامل

Minimax Statistical Learning and Domain Adaptation with Wasserstein Distances

As opposed to standard empirical risk minimization (ERM), distributionally robust optimization aims to minimize the worst-case risk over a larger ambiguity set containing the original empirical distribution of the training data. In this work, we describe a minimax framework for statistical learning with ambiguity sets given by balls in Wasserstein space. In particular, we prove a generalization...

متن کامل

p14: anxiety control using q-learning

anxiety disorders are the most common reasons for referring to specialized clinics. if the response to stress changed, anxiety can be greatly controlled. the most obvious effect of stress occurs on circulatory system especially through sweating. the electrical conductivity of skin or in other words galvanic skin response (gsr) which is dependent on stress level is used; beside this parameter pe...

متن کامل

Dynamic clustering of histograms using Wasserstein metric

In the present paper we present a new distance, based on the Wasserstein metric, in order to cluster a set of data described by distributions with finite continue support. The proposed distance allows to define a measure of inertia of data with respect a barycenter that satisfies the Huygens theorem of decomposition of inertia. Thus, this measure is proposed as allocation function in the dynami...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Automatica

سال: 2023

ISSN: ['1873-2836', '0005-1098']

DOI: https://doi.org/10.1016/j.automatica.2022.110850